Voice-Driven Computer Game in Noisy Environments
نویسندگان
چکیده
The paper describes the performance of a task-oriented continuous automatic speech recognition (ASR) system in the computer game interface in noisy conditions. First, the process of designing the ASR system for Polish, based on CMU Sphinx4, is presented. Then, the concept of the computer game called Rally Navigator is described. The experiments were first run for the clean speech, and then repeated with added environmental noise with various signal-to-noise (SNR) ratios. Results of experiments with clean speech show that as little as 15 minutes of audio material is enough to produce a highly effective single-speaker command-and-control ASR system for the computer game, providing the sentence recognition accuracy of 97.6%. Results of the tests under noisy conditions show that minor degradation of performance was observed in car environment, however, accuracy decreased severely for babble and factory noises for SNR below 20 dB.
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کامل#. A Methodology and a System for Adaptive Speech Recognition in a Noisy Environment Based on Adaptive Noise Cancellation and Evolving Fuzzy Neural Networks
Speech and signal processing technologies need new methods that deal with the problems of noise and adaptation in order for these technologies to become common tools for communication and information processing. This chapter is concerned with a method and a system for adaptive speech recognition in a noisy environment (ASN). A system based on the described method can store words and phrases spo...
متن کاملInterdependent Security Game Design over Constrained Linear Influence Networks
In today's highly interconnected networks, security of the entities are often interdependent. This means security decisions of the agents are not only influenced by their own costs and constraints, but also are affected by their neighbors’ decisions. Game theory provides a rich set of tools to analyze such influence networks. In the game model, players try to maximize their utilities through se...
متن کاملFuzzy-Logic Controller for Speaker-Independent Speech Recognition System in Computer Games
Computer games are now a part of modern culture. By using automatic speech recognition systems (ASRS), voice driven commands can be used to control the game, which can open up the possibility for people with disabilities and age related problems to be included in game communities and use the services offered. Conventional speech recognition systems however, do not support emotions, attitudes, t...
متن کاملVoice Activity Detector and Noise Trackers for Speech Recognition System in Noisy Environment
The well known fact is that the performance of the Speech Recognition System degrades drastically in Adverse Environments. Additive noise is one of the major element of adverse noisy environment. Detecting voiced, un-voiced or silent speech signal in noisy environment is not an easy task. A voice activity detector (VAD) is effective when the noise is stationary; it often fails when the noise st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJCSA
دوره 10 شماره
صفحات -
تاریخ انتشار 2013